Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

نویسندگان

چکیده

VQA is an ambitious task aiming to answer any image-related question. However, in reality, it hard build such a system once for all since the needs of users are continuously updated, and has implement new functions. Thus, Continual Learning (CL) ability must developing advanced systems. Recently, pioneer work split dataset into disjoint sets study this topic. CL on involves not only expansion label (new Answer sets). It crucial how questions when deploying systems environments Visual scenes) requiring functions Question types). we propose CLOVE, benchmark On quEstion answering, which contains scene- function-incremental settings two aforementioned scenarios. In terms methodology, main difference between classification that former additionally expanding preventing forgetting reasoning mechanisms, while latter focusing class representation. real-data-free replay-based method tailored VQA, named Scene Graph as Prompt Symbolic Replay. Using piece scene graph prompt, replays pseudo graphs represent past images, along with correlated QA pairs. A unified model also proposed utilize current replayed data enhance its ability. Finally, experimental results reveal challenges CLOVE demonstrate effectiveness our method. Code available at https://github.com/showlab/CLVQA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Continual Learning with Deep Generative Replay

Attempts to train a comprehensive artificial intelligence capable of solving multiple tasks have been impeded by a chronic problem called catastrophic forgetting. Although simply replaying all previous data alleviates the problem, it requires large memory and even worse, often infeasible in real world applications where the access to past data is limited. Inspired by the generative nature of th...

متن کامل

the effect of teaching reading as whole text task and paragraph task on efl learners speaking

abstract the present study was conducted to investigate the effectiveness of a rather newly developed method in language teaching that is the use of paragraph reading task and whole reading task and their effects on improving efl learner’s speaking ability. to fulfill the purpose of the study, first 90 participants studying their course at sa eedi high school in tehran were chosen by means of ...

15 صفحه اول

Graph Structure Learning for Task Ordering

In many practical applications, multiple interrelated tasks must be accomplished in sequential order through user interactions with multiple retrieval, classification and recommendation systems. The ordering of the tasks may have a significant impact on the overall utility (or performance); hence optimal ordering of tasks is desirable. However, manual specification of near-optimal ordering is o...

متن کامل

Replay Scene Based Sports Video Abstraction

Video abstraction can be useful in multimedia database indexing and querying and can illustrate the important content of a longer video to quick browsing. Further, in sports video, replay scene often demonstrates the highlight of the video. The detection of replay scene in the sports video is a key clue to sports video summarizing. In this paper, we present a framework of replay scene based vid...

متن کامل

Scene Graph Parsing as Dependency Parsing

In this paper, we study the problem of parsing structured knowledge graphs from textual descriptions. In particular, we consider the scene graph representation (Johnson et al., 2015) that considers objects together with their attributes and relations: this representation has been proved useful across a variety of vision and language applications. We begin by introducing an alternative but equiv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i1.25208